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DIAGNOSTIC METHOD AND APPARATUS 

This invention relates to diagnostic method and apparatus based upon a 
polymorphism in a TGF-;9 gene. More specifically, this inverttion relates to a 
method for diagnosis of pre-disposition to certain disease states, by screening for 
the presence of this polymorphism. The invention also relates to apparatus for 
screening for the polymorphism. The invention further relates to TGF-^ genes 
containing the polymorphism and to a probe therefor. 

A number of major disease states are associated with or correlated with 
concentration of transforming growth factor (TGF->ff) in circulating plasma. 
Diseases that have been correlated in this way include atherosclerosis, certain 
forms of cancer, osteoporosis and a number of auto immune disorders, such as 
rheumatoid arthritis, multiple sclerosis, systemic lupus erythematosus, late on set 
diabetes and others. 

In a number of cases, therapy is available for these disease states. However, a 
problem common to many of these therapies is that whilst the therapy is capable 
of halting further development of the disease, the therapy is nevertheless not 
capable of reversing or curing the diseased state to a significant extent. 

Hormone replacement therapy is an established treatment for osteoporosis and has 
proved successful in halting further decline in bone density that is characteristic in 
women suffering from this disease. Hormone replacement therapy is generally not, 
however, able to bring about a reversal of osteoporosis, that is to say it is not 
capable of inducing an increase in the bone density of sufferers. 

It would, accordingly, be of particular advantage to be able to identify with 
increased accuracy those individuals having a predisposition or increased 
susceptibility to osteoporosis. Suitable therapy could then be put into place before 
the effects of osteoporosis set in. 
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A similar situation obtains in respect of atherosclerosis and cancer. In the latter 
case, treatment of cancer often involves severe side effects. If it were possible to 
identify individuals with a predisposition or increased susceptibility to cancers then 
there would be advantage in providing those individuals with preventative therapy 
to reduce or prevent or delay the onset of cancer as part of a therapy having 
reduced side effects to those seen with the standard cancer treatments available. 

It is an object of this invention to provide method and apparatus for detecting 
individuals having a predisposition or susceptibility to certain disease states. It is 
a further object of the invention to identify individuals having such a predisposition 
or susceptibility by identifying those individuals with an altered TGF-yff gene. It is 
another object of the invention to provide a therapy for those individuals having a 
predisposition or susceptibility to certain disease states. A still further object of the 
invention is to provide a therapy for those individuals having a predisposition or 
susceptibility to certain disease states that are correlated with concentration of 
TGF-ff in circulation. 

Accordingly, a first aspect of the invention provides a method of diagnosis 
comprising determining genotype of a TGF-)?1 promoter. 

The method of the invention typically comprises determining whether an individual 
is homozygous or heterozygous for a TGF-y?1 promoter and a particular 
polymorphism thereof. The method is conveniently used to screen for an individual 
at risk of a condition or disease correlated with TGF-;? deficiency, such as 
osteoporosis, or correlated with elevated levels of TGF-fi, such as some cancers. 

The DNA sequence of the TGF-j?1 promoter region is known and has been 
published by Kim, S. J. et al, in J. Biol. Chem. 264 (1989) pages 402-408. This 
sequence is referred to hereafter as the wild type sequence or the published 
sequence. The method of the invention determines whether the individual being 
tested has a TGF-^1 promoter which is identical with the published sequence or 
whether that individual has a TGF-^1 promoter which differs from the published 



wo 97/28280 PCT/IB97/0(M25 

- 3 - 

sequence, i.e. is a polymorphism of the published sequence. In carrying out the 
invention, an individual's TGF-^1 promoter genotype is generally determined by 
analysis of a section of the TGF-fi^ promoter, rather than by analysis of the entire 
gene. If the sequence of that section is found to be the same as the corresponding 
section in the wild type sequence, then that individual is classified as having the 
wild type TGF-y91 promoter gene. 

In use of a specific embodiment of the invention to be described below in further 
detail, an individual is screened to determine whether he or she possess a TGF-^1 
promoter which is the published sequence or is a polymorphism thereof in which 
a guanine nucleotide at position -800bp has been replaced by an adenine 
nucleotide. In this specific embodiment, the presence of the polymorphism in 
which guanine is replaced by adenine at position -SOObp correlates with a 
decreased level of TGF-;9 in circulation and with a predisposition to certain disease 
states. As examples of these disease states are included one or more of 
atherosclerosis, cancers, osteoporosis, rheumatoid arthritis, multiple sclerosis, 
systemic lupus erythematosus and other auto immune disorders. 

Screening is carried out, for example, using PGR primers adapted to amplify a 
portion of the TGF-)ff1 gene that includes the nucleotide at position -SOObp. It is 
preferred that the PGR primers are selected so as to amplify a region of the gene 
that surrounds position -SOObp and includes at least six nucleotides on either side 
of this position. PGR techniques are well known in the art and it would be within 
the ambit of a person of ordinary skill in this art to identify primers for amplifying 
a suitable section of the TGF-/?1 gene including position -SOObp. PGR techniques 
are described for example in EP-A-0200362 and EP-A-0201 184. 

In a specific embodiment of the invention described in further detail below, the PGR 
primers have the nucleotide sequences:- 



CCGGGGTGGATTTCCAGGTG {SEQ ID NO: 1} and 
TGCTCTTGAGGACTGTGGGA (SEQ ID N0:2). 
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The screening is suitably carried out by amplifying a DNA fragment including 
position -SOObp of the TGF-^1 gene and determining whether the amplified region 
is cieavable by the restriction endonuclease maelll. Cleavage by maelll indicates 
that the gene is the wild type, while loss of cleavage by maelll indicates that the 
gene is a variant form, i.e. a polymorphism. It is preferred that tht PGR primers are 
selected so as to be homologous with a region of the genome within Ikb of 
position -SOObp on the TGF-ySI gene. 

In a further embodiment of the invention, the diagnostic method comprises analysis 
of the TGF-jffl promoter using single strand conformational polymorphism (SSCP) 
mapping. It is preferred that the PGR primers are selected so as to be homologous 
with a region of the genome within 200bp of position -SOObp on the TGF-/?1 gene. 
It is further preferred that the PGR primers are selected so that position -SOObp is 
substantially towards the middle of the amplified DNA segment. 

In use of a further specific embodiment of the invention, described in greater detail 
below, an individual is screened to determine whether he or she possesses a TGF- 
P^ promoter which is the published sequence or is a polymorphism thereof in which 
a cytosine nucleotide at position -509bp has been replaced by a thymine 
nucleotide. In this specific embodiment, the presence of the polymorphism 
correlates with an elevated level of TGF-/? in circulation and with a predisposition 
to certain disease states, associated with elevated circulating TGF-)?, such as some 
cancers. 

Screening may be carried out using PGR primers to amplify a portion of the TGF-^1 
gene around position -509. Examples of suitable primers are; 

GAGACTCTAGAGACTGTGAG (SEQ ID NO: 3) and 
GGTCACCAGAGAAAGAGGAC (SEQ ID NO: 4). 

The -509 polymorphism can be detected using Bsu 361, loss of cleaving indicating 
presence of the polymorphism. 
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Alternatively, the -509 polymorphism is detectable using SSCP techniques. 



A second aspect of the invention provides diagnostic means comprising PGR 
primers adapted to amplify a region of a TGF-yffI promoter, preferably a DNA 
segment comprising a nucleotide at position -800bp, or a segment comprising 
position -509bp, on the TGF-^1 gene. It is preferred that the PGR primers are 
adapted to amplify a DNA segment that is up to 2kb in length, more preferably up 
to Ikb in length. In a particular embodiment of the invention the segment is 
approximately 400bp in length. 

In specific embodiments of the invention described below, the PGR primers are 

(i) CCGGGCTCGATTTGCAGGTG {SEQ ID N0:1) and 
TGCTGTTGACGAGTGTGCCA (SEQ ID NO:2). or 

(ii) CAGACTCTAGAGACTGTGAG (SEQ ID NO: 3) and 
GGTCAGGAGAGAAAGAGGAG (SEQ ID NO: 4). 

Optionally, the diagnostic means further comprises means to determine which 
nucleotide is found at position (a) -SOObp or (b) -509bp on the TGF-yffI gene. 
Examples are the restriction endonuclease maelll for (a) and the restriction 
endonuclease Bsu 341 for (b). As will be appreciated by a person of skill in the art, 
an alternative restriction endonuclease that is for example able to cleave the DNA 
segment when a guanine nucleotide is at position -SOObp and is not able to cleave 
the segment when an adenine nucleotide is at position -SOObp is also suitable. 
Alternatively, a suitable agent would cleave the segment when the nucleotide at 
position -SOObp is adenine and not cleave the segment when this nucleotide is 
guanine, and likewise for the cytosine/thymine polymorphism at position -509bp. 
The invention further provides a diagnostic kit comprising diagnostic means 
according to the second aspect of the invention, optionally within a container. 

A third aspect of the invention provides DNA probes comprising a sequence 
selected from SEQ ID NO:s 1-4. 
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A fourth aspect of the invention provides a TGF-^1 gene in which a guanine 
nucleotide at position -SOObp is replaced with an adenine nucleotide. 

A fifth aspect of the invention provides a TGF->91 gene in which a cytosine 
nucleotide at position -509bp is replaced with a thymine nuclecvtide. 

The present invention is based upon the discovery of a single base polymorphism 
in the TGF-^1 promoter. An aspect of the invention is that the polymorphism is 
correlated with a predisposition to a number of disease states including, in 
particular, atherosclerosis, cancers, osteoporosis, rheumatoid arthritis, multiple 
sclerosis, systemic lupus erythematosus, and other immune disorders. The 
invention is of advantage in that by screening for the presence of the polymorphism 
it Is possible to identify individuals likely to have a genetic predisposition to one or 
more of these disease states. 

Accordingly, a sixth aspect of the invention provides a method of therapy 
comprising screening an individual for a predisposition to osteoporosis and, if a 
genetic predisposition is identified, treating that individual to delay or reduce or 
prevent the osteoporosis. 

A suitable treatment to prevent or reduce or delay osteoporosis is hormone 
replacement therapy. The use of this therapy is well known in the art. According 
to the invention, hormone replacement therapy can thus be commenced in 
individuals likely to have a predisposition to osteoporosis but in whom osteoporosis 
has not yet begun to any significant extent. 

It is believed that the use of hormone replacement therapy carries with it a 
concomitant increased risk of breast cancer. The invention offers the advantage 
that the increased risk of breast cancer associated with hormone replacement 
therapy can be accepted only by those women who are known to have a likelihood 
of predispos'rtton to osteoporosis. In an embodiment of the sixth aspect of the 
invention, the predisposition of an individual to osteoporosis is assessed by 
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determining whether that individual is homozygous for the wild type TGF-/91 gene, 
is heterozygous for the wild type and the polymorphism in which guanine at 
position -800bp is replaced by adenine, or is homozygous for the polymorphism. 

According to the invention, an individual who is homozygous for »he polymorphism 
is classified as being at highest risk. An individual being heterozygous is classified 
as having moderate risk. An Individual being homozygous for the wild type TGF-;ffl 
gene is classified as being In the lowest risk category. 

Optionally, the assessment of an Individual's risk factor is calculated by reference 
both to the presence of a TGF-;?1 promoter polymorphism and also to other known 
genetic or physiological or dietary or other indications. The invention in this way 
provides further information on which measurement of an individual's risk can be 
based. 

In a seventh aspect of the invention there is provided a method of therapy in which 
a predisposition of an Individual to atherosclerosis is determined and then, if a 
predisposition is confirmed, that person is treated to prevent, reduce or delay 
atherosclerosis. The predisposition to atherosclerosis is assessed, in an 
embodiment of the invention, using the same criteria as for measurement of 
predisposition to osteoporosis in the above aspect of the invention. In an 
embodiment of the seventh aspect, the method comprises determining whether an 
individual has a predisposition to atherosclerosis and, treating an Individual with 
such a predisposition to prevent, reduce or delay atherosclerosis. For example, the 
treatment could comprise alterations in that individual's diet. Another treatment 
could be pharmaceutical, such as by administration of an anticoagulant agent or a 
fibrinolytic agent. Specific treatments and methods are disclosed in US patents 
5242397 and 5171217, the contents of which are incorporated herein by 
reference. 

In further aspects, the invention provides analogous therapies against cancer and 
auto-immune disorders, based upon identification of a TGf-P^ polymorphism 
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correlated with a predisposition to these diseases. In the case of cancer, disease 
may result from either too much TGF-/3 or too little. Consequently, one such 
polymorphism is replacement of a cytosine at position -509bp by thymine; another 
is replacement of a guanine at -800bp by an adenine. In the case of auto-immune 
disorders, expected to be associated with lowered TGF->ff, the fV)lymorphism at - 
SOObp in which adenine replaces guanine would be correlated with predisposition 
to disease; the polymorphism at -509bp, in which thymine replaces cytosine would 
be negatively correlated with disease. 

The concentration of transforming growth factor /? (TGF-^) in circulating plasma has 
been correlated with the development of several major diseases, including 
atherosclerosis and certain forms of cancer. However, the mechanisms which 
control the concentration of TGF-^ in plasma are poorly understood. The present 
invention is based upon our discovery that the concentration of TGF-yff in plasma 
(active TGF-^ plus plasma latent TGF-;ff forms) is predominantly (79%) under 
genetic control in 66 out of 71 pairs of monozygotic female twins who were post- 
menopausal. The concentration of active TGF-^ was also significantly under 
genetic control (57%). Analysis of the TGF-;S1 promoter by single strand 
conformational polymorphism (SSCP) mapping has identified a single base 
polymorphism (A/G at position -SOObp) which is significantly associated (p < 0.01 ; 
n = 276) with lower levels of TGF-/9 in plasma. These data suggest that 
predisposition to atherosclerosis or various forms of cancer may be linked to 
particular alleles within the TGFb 1 locus. 

TGF-;? is a multifunctional cytokine which regulates the proliferation and 
differentiation of a wide variety of cell types in vitro. Recently, pathological 
misregulation of the TGF-^ system has been implicated in the development of 
several major diseases, including various forms of cancer, atherosclerosis and 
fibrotic disease. Moses and colleagues showed that local expression of a 
constitutively active TGF-y?l transgene prevented the development of mammary 
carcinoma induced either by transgenic overexpression of TGF-a or by the chemical 
carcinogen DMBA ((1995) Proc. Natl. Acad. Sci USA 92). Similarly, studies from 
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the groups of Akhurst and Balmain have demonstrated that low levels of TGF-/?1 
staining is prognostic for a high risk of malignant conversion of benign tumours in 
the p53 knockout mouse ((1994) Cancer Res. 54, 5831-5836). More recently, 
Markowitz et al Science 268, 1336-1338, 1995, have identified somatic 
mutations of the TGF-y?type II signalling receptor, which would l^e likely to render 
it non-functional, in human colon cancer biopsies. These data suggest that local 
decreases in TGF-ff activity may be involved in transformation to the malignant 
phenotype in vivo. By contrast, Arteaga et al. J. Clin. Invest. 92, 2569-2576, 
1995, have shown that transfecting a transformed cell line with a construct 
expressing active TGF-y? rendered the cells more tumorigenic in vivo, probably 
because TGF-;9 is immunosuppressive and immune surveillance was compromised. 
Consistent with this observation, elevated levels of plasma TGF-yff in patients with 
malignant prostatic tumours and hepatocellular carcinoma have been reported. 
While local suppression of TGF-;ff activity may result in malignant conversion, 
elevated plasma levels of TGF-fi are correlated establishment of the tumour. 

In studies on the role of TGF-^ in atherogenesis, we have shown that mice 
expressing the apolipoprotein(a) transgene develop diet-induced lipid lesions 
resembling early human atherosclerotic plaques at sites in the vessel wall where 
TGF-fi activity is locally depressed by high concentration of apolipoprotein(a). It 
has also been shown that tamoxifen, which elevates TGF-^ activity both in the 
vessel wall and serum of mice, will prevent diet-induced lipid lesion formation in 
both apolipoprotein{a) mice and in C57BI6 inbred mice. Consistent with these 
observations we have shown that the concentration of active TGF-^ is depressed 
by five fold in Individuals with severe coronary atherosclerosis compared to 
individuals with normal coronary arteries determined by angiography. Taken 
together, these studies suggest that decreased TGF-/ff activity, either in the vessel 
wall or in the circulation, is an important step in the development of 
atherosclerosis. Despite these correlations with major diseases, the mechanisms 
which control the concentration of TGF->S in circulating plasma are poorly 
understood. In the present invention we have examined whether there is genetic 
regulation of plasma TGF-fi concentrations. 
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TGF-y? protein in man is derived from three unlinked genetic loci, TGFbl, TGFb2 
and TGFb3 which express three protein isoforms TGF-;?1 , $2 and ;S3. Studies using 
isoform-specific ELISA assays have demonstrated that TGF-/92 is not present in 
human blood and TGF-yS3 has been detected in platelet-poor plasma and serum 
from only 2/22 Individuals tested (unpublished observations of the inventors). 
Most or all of the TGF-^ present in blood from most individuals is therefore the b1 
"rsoform. Recent studies have shown that TCF-^ffl in blood is present in several 
different protein complexes. The highest concentrations of TGF-^1 are contained 
in the platelets, where it is present as two distinct platelet latent complexes, 
termed large and small latent complexes, which have no known biological activity. 
In contrast, platelet-poor plasma contains a biologically active form of TGF-y?1 and 
a latent complex, which is distinct from either of the platelet latent complexes, that 
we have termed plasma latent complex. The same forms of plasma TGF-)9 are 
present in serum, together with much larger concentrations of platelet large latent 
complex which is released into serum when clotting occurs. For our present 
studies of the genetic control of plasma TGF-/9 in populations of twins, an ELISA 
was used which detects only the two forms of plasma TGF-yS described above: the 
active form together with the plasma latent complex. The capture antibody 
(BDA1 9; R&D Systems) does not recognise the platelet large latent complex which 
is released into serum and consequently the BDA19 ELISA detects the same 
concentrations of TGF-;J in plasma or serum prepared from the same blood sample. 

There now follows description of specific embodiments of the invention illustrated 
by figures 1-4 in which:- 

Fig 1 shows the circulating concentration of TGF-^ in twin pairs. The concentration 
of active plus plasma latent TGF-jS {a,b) and active TGF-y? {c,d) was measured in 
serum from monozygotic (a,c) and dizygotic (b,d) twins. All individuals were 
female, post-menopausal and not receiving hormone replacement therapy. For 
each pair, twin 1 was arbitrarily designated as the sibling with lowest circulating 
concentration of TGF-/?. 
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Fig 2 shows a polymorphism in the TGF-^1 promoter associated with circulating 
TGF-;9 concentration, (a) Single strand conformation polymorphism (SSCP) 
acrylamide gel of non-denatured DNA obtained by polymerase chain reaction 
between primers 5'-CCCGGCTCCATTTCCAGGTG-3' (-1 106 to -1 125bp) (SEQ ID 
NO: 1) and 5'-TGCTCTTGACCACTGTGCCA-3' (-738p to -757bp) (SEQ ID NO: 2) 
with lymphocyte-derived genomic DNA from four healthy Caucasian donors as the 
template. One heterozygote (lane 1 ) is shown with the diagnostic doublet indicated 
by an arrow. <b) Diagrammatic representation of the putative transcription factor 
binding sites in the TGFbl promoter. Hatched box : one or more consensus spl 
binding sites; vertical line : consensus ap2 binding site; stippled oval : consensus 
glucocorticoid response element; filled box : consensus CREB half-site. A portion 
of the wild type sequence is shown - CTCTGCCTCCAACGTCACCACCAT = SEQ 
ID NO: 5. The sequence of the CREB half-site is shown (boxed) with the single 
base polymorphism that resulted in the SSCP doublet shown in (a) marked in bold 
script. The maell{ consensus sequence present in the G allele (GTNAC) is 
underlined. All nucleotide positions are related to the most 5' transcriptional start 
site of the TGFbl gene described in J Biol Chem, 264, pages 402-408. 1989. 
(c) Distribution of circulating concentration of TGF-yS in those twin pairs analyzed 
in Fig. 1 who have no A allele identified by the absence of undigested DNA in the 
maelll digestion of the PCR fragment obtained in (a) above, (d) Distribution of the 
circulating concentration of TGF-fi in individuals with AG genotype. 

Fig. 3 shows the location of polymorphisms in the TGFbl promoter, and includes 
a diagrammatic representation of the putative transcription factor binding sites in 
the TGFbl promoter. Hatched box : one or more consensus spl binding sites; 
vertical line : consensus ap2 binding site; stippled oval : consensus nuclear 
hormone binding element; filled box : consensus CREB half-site. Portions of the 
wild type sequence are shown -CTCTGCCTCCAACGTCACCACCAT = SEQ ID NO: 
5, CCCTTCCATCCCTCAGGTGTCCT = SEQ ID NO: 6. The sequence of the CREB 
half-site is shown (boxed) with the single base polymorphism at -800bp marked in 
bold script. The maelll consensus sequence present in the G allele (GTNAC) is 
underlined. The sequence surrounding the single base polymorphism at -509bp 
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(bold script) is also shown. All nucleotide positions are related to the most 5' 
transcriptional start site of the TGFbl gene. 

Fig 4 shows association between polymorphisms in the TGFbl promoter region and 
circulation concentration of TGF-fi Boxplots (inter-quartile range) of (a + l) (a,c) 
TGF-^ concentrations and active (b,c) TGF-y? concentrations in groups of subjects 
sorted by genotype at -800bp |a,b) and -509bp {c,d). 

Example 1 

TGF-y? was assayed by BDA19 ELISA in serum samples from 136 pairs of twins (71 
monozygotic and 65 dizygotic pairs) to assess the genetic contribution to variation 
in TGF-/9 concentration. The concordance in concentration between the 
monozygotic siblings was very high (sn/mean = 16.9% +/- 2.3 %; n = 66) in 
66/71 pairs. For each of the remaining five pairs, one sibling had an elevated level 
of circulating TGF-fi which was higher than in any of the other 66 twin pairs and 
was more than five standard deviations from mean of the population as a whole. 
It is very likely that these individuals had elevated concentrations of circulating 
TGF-fi due to environmental influences and these five twin pairs were excluded 
from further analysis. For the remaining monozygotic twin pairs, the intraclass 
correlation coefficient (r^MZ) was 0.79 (Fig la). 

Using the same exclusion criteria for the dizygotic twins, there were two twin pairs 
in which one sibling had a concentration of TGF-fi greater than for any of the 
concordant monozygotic twin pairs and more than five standard deviations from the 
mean of the whole population. For the remaining 63 dizygotic pairs the correlation 
was less marked (intraclass correlation coefficient r^D2 =0.39; Fig 1 b) than for the 
monozygotic twin pairs. If genetic variation at a single locus were responsible for 
the variation in TGF-;ff concentration, the r^DZ to r^MZ ratio would be predicted to 
be 0,5 and as the number of unlinked alleles contributing to the control of TGF-/S 
concentration increases, the r^DZ to r^MZ ratio decreases. In this study r^MZ / r^DZ 
= 0.49, suggesting that genetic variability at very few unlinked loci, or possibly at 
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is responsible for the genetic control of circulating TGF~fi. 



The concentrations of active TGF-/ff in the serum samples were also measured using 
an ELISA in which the extracellular domain of the type II JGF-fi receptor (R2X) is 
the capture agent. The concentration of active TGF-^ in the circulation was also 
significantly correlated for the monozygotic twin pairs (r^MZ =0.57; Fig 1c), but 
much less so in the dizygotic twins (r^DZ = 0. 1 1 ; Fig 1 d). Since the r^DZ to r^MZ 
ratio is much lower than 0.5 we conclude that a number of unlinked genetic loci 
are likely to contribute to the variation in the concentration of active TGF-fi. This 
is consistent with our previous observation that activation of TGF-)5 was inversely 
correlated with the concentration of lipoprotein(a), since the circulating 
concentration of Lp(a) is genetically determined at a locus unlinked to TGFbl. 

Since most of the TGF-^ in plasma is the b1 isoform we investigated whether 
mutations at the TGFbl locus influence the circulating concentration of TGF-^. 
Using SSCP mapping a polymorphism was identified in the promoter region of the 
TGFbl gene approximately 1 kb upstream from the transcriptional start site (Fig 
2a, b). Sequencing of the PGR fragment analyzed by SSCP mapping identified the 
polymorphism as a single base change at position -SOObp, with A replacing the G 
in the published genomic sequence ((1989) J. Biol. Chem. 264, 402-408) in 
approximately 10% of alleles. The presence of adenine at position -SOObp 
destroyed a restriction site for the endonuciease maelll which was therefore used 
to scan the promoter from the 1 36 twin pairs for the presence of this 
polymorphism. 52/276 individuals were heterozygous for this polymorphism and 
one monozygotic twin pair (2 individuals out of 276) were homozygous for A at 
position -SOObp. The polymorphism was therefore present in this population at a 
frequency of 9.8% of alleles tested. 

We tested whether the individuals with rare A allele had circulating levels of TGF-^ 
significantly different from the more common G allele. The mean TGF-/? serum 
concentration assayed by BDA19 ELISA in individuals with GG genotype was 6.4 
+ /- 0.6 ng/ml (n = 252; Fig 2c), not significantly different from the mean TGF-^ 
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concentration of 8.9 + /- 1 .5 reported for a group of female donors of similar mean 
age. By contrast, the mean TGF-;e concentration in individuals with AG genotype 
was 4.2 4-/-1.1 ng/ml(p<0.01; n = 62; Fig 2d). The siblings of the monozygotic 
twin pair homozygous for the A allele had TGF-^ concentration of < 1 ng/ml (below 
the detection limit of the assay used) and 2ng/ml. Kruskal-Walis ;est for difference 
between GG and AG genotypes is P = 0.01 taking all twins into account as 
independent individuals. We conclude that the presence of the A allele is 
significantly associated with lower circulating concentrations of TGF-yS. 

The polymorphism described is present in a consensus CREB half-site and the A 
allele would be expected to have reduced affinity for the CREB family of 
transcription factors. The polymorphism described may therefore be directly 
responsible for the different concentrations of TGF-^ or alternatively it may be in 
linkage disequilibrium with other as yet unidentified polymorphisms in the TGFbl 
locus which are affecting the concentration of TGF-;?. Taken together, the data 
suggest that a major part of the genetic control over circulating levels of TGF-y? may 
reside in the TGFbl locus, since only a very few unlinked loci, or possibly only a 
single locus, are implicated in the control mechanism and we have identified a 
polymorphism in the TGFbl locus which is correlated with TGF-fi concentration. 

Example 2 

The present Example follows on from the work reported in the first Example above. 
Subjects 

All the subjects tested here were female (age range 39 to 70 years: mean 57.7 years) 
and the majority were post-menopausal. Twins were recruited foltowing a national 
media campaign and were broadly representative of the normal United Kingdom 
population. None of the subjects were taking hormone replacement therapy or other 
hormonally active medications. Serum was prepared and stored for TGF-iS analysis. 
Lymphocyte DNA was also prepared for each subject using the standard phenol 
extraction method and zygosity status was detemnined by questionnaire and multiplex 
fingerprinting. 
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TGF-g analysis 

Active plus acid-activatable latent (a+I) TGF->3 was measured using the BDA19 
capture ELISA. Active TGF-0 was measured using the R2X capture ELISA. A single 
determination of each sample was made. The intra-assay co-efficient of variation of 
the assay used are 6.8% and 7.4% respectively. Semm samples with TGF-P 
concentration below the detection limit of the assays used (<1ng/ml; 27/334 for (a+I) 
TGF-^ and 37/334 for active TGF-;ff) were assigned a value of 0.5ng/ml. 



Polymerase chain reaction (PGR) was used to amplify fragments of the promoter that 
were approximately 400bp long. 1.0mg of genomic DNA was added to a 20ml 
reaction consisting of 1x Taq Polymerase Buffer (Phamiacia). SOnmol of each dNTP. 
and 6.25pmol forward primer. The reaction was heated to 95 degrees C for 30s and 
then held at 80 degrees C while 5ml containing 6.25pmol reverse primer and 0.5 units 
Taq Polymerase in 1x Taq Polymerase Buffer was added. Samples were amplified 
for 35 cycles of 95 ITCH for 1 minute, 60 ITCH for 1 minute and 72 ITCH for 1 
minute, followed by an extension period of 10 minutes at 72 ITCH. Primers were 5- 
CCCGGCTCCATTTCCAGGTG-3"(SEQ ID NO: 1) and 5- 
TGCTCTTGACCACTGTGCCA-3'{SEQ ID NO: 2) for G-800A polymorphism and 5- 
CAGACTCTAGAGACTGTCAG-3'(SEQ ID NO: 3) and 5'- 
GGTCACCAGAGA/\AGAGGAC-3'(SEQ ID NO: 4) for C-509T polymorphism. PGR 
products were denatured with 0.2M sodium hydroxide and 0.5mM disodium 
ethylenediaminetetraacetic acid at room temperature. Single stranded DNA bands 
were resolved on a 20% acrylamide gel (Phastsystem, Pharmacia or XCell II system, 
R&D Systems) and visualised by silver staining. 



Genotypinq 

G-800A polyPDorphism abolishes a Mae III restriction site. Mae III and Mae III Buffer 
(Boehringer) were added directly to the PGR products to a final volume of 30 mi. and 
incubated at 55 ITCH for 5 hours. Digests were resolved on a 1.2% agarose gel. 
C-509T polymorphism is in a Bsu 36 I restriction site. PGR products were precipitated 
with 3 volumes 96% ethanol and 1/10 volume 3 M Sodium Acetate pH 5.2 at -20 
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ITCH for 1 hour, followed by centrifugation at 13.000 r.p.m. for 10 mins. The DNA 
was resuspended in a 20 ml Bsu 36 I digest containing 10 units Bsu 36 I (New 
England Biolabs) and incubated at 37 ITCH for a minlnnum of 12 hours. Digests were 
resolved by 1.5% agarose gel electrophoresis. 

TGF-iS was assayed by BDA19 ELISA in serum samples from 174 pairs of twins (87 
monozygotic (MZ) and 67 dizygotic (DZ) pairs) to assess the genetic contribution to 
variation in TGF-fi concentration. The baseline characteristics of these twins are 
shown in Table 1 . The MZ twins were slightly older than the DZ twins (mean age 
58.9 years versus 56.6 years) and a significantly higher proportion were post- 
menopausal (87% versus 75%). However, neither of these variables were 
significantly associated with the concentration of (a+l) or active TGF-ff. Consistent 
with our previous studies, the plasma (a+l) TGF-^ concentrations of this population 
did not conform to a normal distribution. A boxcox transformation was applied to 
maximise normality, which resulted in equal means and variances between the MZ 
and DZ groups but the distributions remained significantly non-normal. We therefore 
treated TGF-fi concentration as a categorical variable, assigning each individual into 
an approximate fertile (for (a+l) TGF-;ff these were 'low', less than 4ng/ml; 'middle' 
4-5ng/ml; 'high' more than 5ng/ml). For (a+l) TGF-/?, the percent concordance (that 
is the proportion of twin pairs where both individuals fell into the same fertile) for MZ 
pairs (57.8%) was significantly higher than for DZ pairs (40.5%; p = 0.025; Chi-square 
test; Table 2). This demonstrates that the concentration of (a+l) TGF-j? in plasma is 
influenced by genetic factors. 

The concentrations of active TGF-fi in the serum samples were also measured using 
an ELISA in which the extracellular domain of the type II TGF-fi receptor (R2X) as 
the capture agent. The distribution of active TGF->7 concentrations for both groups 
of twins were significantly different from a normal distribution, even after boxcox 
transformation, as for the distribution of (a+l) TGF-;? concentrations. Treating the 
active TGF-fi concentration as a categorical variable ('low' less than 3ng/ml; 'middle' 
3 or 4ng/ml; 'high' greater than 4ng/ml). the percent concordance l)etween tertiles 
was again significantly higher for MZ twins (62.8%) than for DZ twins (43.6%; p = 
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0.016; Chi-square test; Table 2). It appears, therefore, that the concentration of 
active TGF->S in plasma is also under genetic control. 

Since most of the TGF-;? in plasma is the TGF-iff1 isoform we investigated whether 
mutations at the TGFbl locus influence the circulating concentratioo of TGF-)S. Using 
SSCP analysis two polymorphisms were identified in the promoter region of the 
TGFbl gene within I.Skbp upstream from the major transcriptional start site. 
Sequencing of the PGR fragments analyzed by SSCP mapping identified two single 
base substitution polymorphisms. The first polymorphism occurred at -800bp, with 
adenine replacing guanine in the published genomic sequence in approximately 10% 
of alleles (G-800A; Fig 3). The second polymorphism was at -509bp with thymine 
replacing cytosine in approximately 30% of alleles (C-509T; Fig 3). The genotype at 
these two sites in the TGFbl promoter region was determined for the majority of the 
individuals in the twin study. For the G-800A polymorphism we found 59/341 
individuals were heterozygous (AG) and two individuals (one monozygotic twin pair) 
were homozygous AA, corresponding to an A allele frequency of 0.092 in this 
population. For the C-509T. 160/330 individuals were heterozygous (CT), and 24/330 
individuals were homozygous for thymine at this position (T allele frequency = 0.315). 
The G-800A polymorphism is in Hardy-Weinberg equilibrium within the twin study 
(c2=0.345. p =0.557), however the C-509T polymorphism does not exhibit Hardy- 
Weinberg equilibrium when all twins are included (c2=5.012, p=0.025). 

We tested whether there was any association between the TGFbl promoter genotype 
and plasma concentration of (a+l) TGF-yff. The presence of an A allele is significantly 
associated with lower circulating concentrations of TGF-/?, irrespective of whether all 
individuals are included (tied p = 0.0005; n =328; Kruskal-Wallis test) or if only one 
individual selected at random is included from each twin pair (tied p = 0.02; n=164; 
Kmskal-Wallis test; Fig 4a). The presence of the T allele at -509bp is significantly 
associated with higher plasma concentrations of TCF-fi (tied p = 0.0004; n=319; 
Kruskal-Wallis test), and this association is still significant if only one twin from each 
pair is Included in the analysis (tied p=0.016. Kruskal-Wallis test; Fig 4c). The 
G-800A and C-509T mutations are in repulsion, as genotypes such as AGTT, which 



wo 97/28280 PCT/IB97/00425 

- 18 - 

would require adenine and thymine to be in linkage have never been observed. 



It was observed that (Fig 4b,d) the T allele at -509bp is significantly associated with 
higher concentrations of active TGF-0 (p=0.023; Kruskal-Wailis test with only one twin 
from each pair included) and the A allele -SOObp is likely to be asspciated with lower 
concentrations of active TGF-y? (p=0.076; Knjskal-Wallis test with only one twin from 
each pair included). 

The presence of polymorphisms in the TGFbl locus may indicate predisposition to 
diseases that have been linked to circulating levels of TGF-/?, including 
atherosclerosis (decreased circulating TGF-/ff1) and some forms of cancer (elevated 
circulating TGF-;ff1 ). 

The inventk>n thus provides method and apparatus for identification and treatment of 
individuals having a TGF-)ff1 polymorphism, correlated with a predisposition to certain 
disease states. 
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TABLE 1 

MZ (n=174) DZ (n=174) p 



Age (years) 58.9 ± 6.6 56.6 ± 8.3 0.006 

Height (cm) 160.2 ±6.1 161.3 ± 5.9 0.078 

Weight (kg) 62.9 ± 9.7 65.0 ± 11.0 0.063 



No. (%) post-menopausal 131 (87) 112 (75) 0.007 

No. (%) current smoker 24 (14) 33 (19) 0.325 

No. (%) on no alcohol 23 (13) 15 (9) 0.121 

No. (%) previous HRT use 32 (18) 31 (18) 0.889 

No. (%) hysterectomy 35 (20) 37 (21) 0.791 



Baseline characteristics of population. Only age and the number of post-menopausal 
individuals differed between the monozygous (MZ) and dizygous (DZ) groups (p 
values highlighted in bold). None of the individuals studied were currently on 
hormone-replacement therapy or taking other hormonally active medications. 
Abbreviation : HRT. hormone replacement therapy. 
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TWIN 2 

'Low* 'Middle' 'High' 

'Low' MZ 12% 12% 4% 

T DZ 11% 14% 5% 

W 

I 'Middle' MZ 4% 17% 17% 

N DZ 11% 17% 13% 

1 'High' MZ 2% 4% 29% 

DZ 6% 11% 13% 



(a) (a+I) TGF-^ 



(b) active TGF-/3 

TWIN 2 









'Low' 


•Middle' 


'High' 




'Low' 


MZ 


12% 


5% 


4% 


T 




DZ 


18% 


14% 


3% 


W 












I 


•Middle' 


MZ 


6% 


30% 


14% 


N 




DZ 


8% 


18% 


12% 


1 


'High' 


MZ 


3% 


5% 


22% 






DZ 


8% 


12% 


8% 



Within-pair concordance of (a+I) and active TGF-/ff concentrations t>etween MZ and 
DZ twin pairs. The concentration of (a+I) (a) or active (b) TGF-0 was treated as a 
categorial variable, and individuals assigned to approximate tertiles (for (a+I) TGF-fi 
: 'low', <4ng/ml; 'middle' 4-5ng/ml; 'high' >5ng/ml and for active TGF-fi : 'low', 
<3ng/ml; 'middle' 3-4ngyml; 'high' >4ng/ml). The percentage of twin pairs in each of 
the MZ and DZ groups are shown separately for each combination of TGF-fi 
concentration categories. The percentage patrwise concordance for each group is the 
proportion of twin pairs on the diagonal of each table. The percentage pairwise 
concordance is significantly higher from MZ than DZ twins for both (a+I) TGF-fi 
(p=0.025; Chi-square test) and active TGF-fi (p=0.016; Chi-square test). Where the 
proportion of twins in the cell differs significantly between the MZ and DZ groups 
(p<0.05; Chi-square test) the percentages are highlighted in bold. 
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1. A method of diagnosis comprising determining genotype of a TGF-/J1 
promoter. 

2. A method of diagnosis according to Claim 1 comprising determining whether 
an individual possesses a wild type promoter. 

3. A method of diagnosis according to Claim 1 or 2 comprising determining 
whether an individual possesses a wild type TGF->ff1 promoter or a variant 
promoter that differs from the wild type by at least a single nucleotide 
substitution. 

4. A method according to daim 3 wherein the variant differs from the wild type 
In that cytosine at position -509bp is substituted by thymine. 

5. A method according to Claim 3 wherein the variant differs from the wild type 
in that guanine at position -800bp is substituted by adenine. 

6. A method according to any of Claims 1 -5 comprising amplifying a portion of 
that Individual's TGF-)ff1 gene using PCR techniques. 

7. A method according to any preceding claims for diagnosis of a disease 
selected from osteoporosis, atherosclerosis, cancer and immune disorders. 

8. Diagnostic means, comprising means for determining genotype of a TGF-jJ1 
promoter. 

9. Diagnostic means according to Claim 8, comprising PCR primers adapted to 
amplify a region of a TGF-/?1 promoter. 

10. Diagnostic means according to Claim 9 comprising PCR primers adapted to 
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amplify a DNA segment comprising a nucleotide at position -800bp on a TGF- 
/?1 gene. 

1 1 . Diagnostic means according to Claim 9 comprising PGR primers adapted to 
amplify a DNA segment comprising a nucleotide at position 509bp on a TGF- 
p\ gene. 

12. Diagnostic means according to Claim 9 wherein the PCR primers are selected 
from (i) SEQ ID NO: 1 and SEQ ID NO: 2 and (ii) SEQ ID NO: 3 and SEQ ID 
NO: 4. 

13. Diagnostic means according to any of Claims 8-12 further comprising an 
enzyme such as a restriction endonuclease capable of cleaving, a wild type 
TGF-^1 pronr«)ter at position -800bp and not capable of cleaving a variant of 
the wild type pronrtoter in which the nucleotide at position -800bp is adenine. 

14. Diagnostic means according to any of Claims 8-12 further comprising an 
enzyme such as a restriction endonuclease capable of cleaving a wild type 
TGF->?1 pronrater at position -509bp and not capable of deaving a variant of 
the wild type promoter in which the nucleotide at position -509bp is thymine. 

1 5. DNA comprising a TGF-^1 gene, or a fragment thereof at least 15 nucleotides 
in length. In which guanine at position -SOObp is substituted by adenine, or 
DNA complementary thereto. 

1 6. DNA comprising a TGF-/ff1 gene, or a fragment thereof at least 1 5 nucleotides 
in length, in vi^ich cytosine at position -509bp is substituted by thymine, or 
DNA complementary thereto. 

17. A method of osteoporosis therapy comprising:- 



screening an individual for a genetic predisposition to osteoporosis: and 
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if such a predisposition is identified, treating that individual to prevent or 
reduce osteoporosis or to delay the onset of osteoporosis. 

18. A method according to Claim 17 wherein a predisposition to osteoporosis is 
correlated with a TGF->71 promoter gene which varies from the wild type 
promoter sequence. 

19. A method according to Claim 17 or 18 in which a predisposition to 
osteoporosis is correlated with a TGF-01 promoter in which at position -800bp 
a guanine nucleotide is substituted by an adenine nucleotide. 

20. A method according to any of Claims 17-19 comprising treating the individual 
by hormone replacement therapy. 

21. A method of atherosclerosis therapy comprising:- 

screening an individual for a genetic predisposition to atherosclerosis; and 

rf such a predisposition is identified, treating that individual to prevent or 
reduce atherosclerosis or to delay the onset of atherosclerosis. 

22. A method according to Claim 21 wherein a predisposition to atherosclerosis 
is correlated with a TGF-/J1 promoter gene which varies from the wild type 
promoter sequence. 

23. A method of cancer therapy comprising:- 

screening an individual for a genetic predisposition to cancer; and 



if such a predisposition is identified, treating that individual to prevent or 
reduce cancer or to delay the onset of cancer. 
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24. A method according to Claim 23 wherein a predisposition to cancer is 
correlated with a TGF-/?1 promoter gene which varies from the wild type 
promoter sequence. 

25. A method according to Claim 23 wherein a predisposition to cancer is 
correlated with a TGF-;ff1 promoter gene in which cytosine at position -509bp 
is substituted by thymine. 

26. A method according to Claim 23 in which a predisposition to cancer is 
correlated with a TGF-iff1 promoter in which at position -SOObp a guanine 
nucleotide is substituted by an adenine nucleotide. 

27. A method of immune disorder therapy comprising:- 

screening an individual for a genetic predisposition to immune disorder; and 

if such a predisposition is identified, treating that individual to prevent or 
reduce immune disorder or to delay the onset of immune disorder. 

28. A method according to Claim 27 wherein a predisposition to immune disorder 
is correlated with a TGf-0^ pronrrater gene in which cytosine at position - 
509bp is substituted by thymine. 

29. A method according to Claim 27 in which a predisposition to immune disorder 
is correlated with a TGF-/?1 promoter in which at position -SOObp a guanine 
nucleotide is substituted by an adenine nucleotide. 

V 

30. A diagnostic kit comprising means according to any of claims 8-14. 

31. Use, in the manufacture of means for assessing whether an individual has a 
predisposition to osteoporosis, of PGR primers adapted to amplify a region of 
a TGF-/J1 promoter. 
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32. Use, in the manufacture of means for assessing whether an Individual has a 
predisposition to cancer, of PGR primers adapted to amplify a region of a TGF- 
/?1 promoter. 

33. Use, in the manufacture of means for assessing whether afi individual has a 
predisposition to atherosclerosis, of PGR primers adapted to amplify a region 
of a TGF-/?1 promoter. 

34. Use, in the manufacture of means for assessing whether an Individual has a 
predisposition to immune disorder, of PGR primers adapted to amplify a region 
of a TGF-/?1 promoter. 

35. A method of diagnosis of predisposition to osteoporosis comprising 
determining a genetic disposition to osteoporosis. 

36. A method of diagnosis of predisposition to cancer comprising determining a 
genetic disposition to cancer. 

37. A method of diagnosis of predisposition to atherosclerosis comprising 
determining a genetic disposition to atherosclerosis. 

38. A method of diagnosis of predisposition to immune disorder comprising 
determining a genetic disposition to immune disorder. 

38. A probe for identification of a polymorphism of a TGF-/J1 promoter comprising 
a section of DNA of the promoter in which guanine at position -800bp has 
been replaced by adenine, or cytosine at position -509bp has been replaced 
by thymine, or a section of DNA complementary thereto. 



39. 



A probe according to claim 38 being at least 15 nucleotides in length. 
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